Picture for Tong Yang

Tong Yang

Henan Polytechnic University

A Primer in Post-Training Reasoning Data: What We Know About How It Works

Add code
Jun 01, 2026
Viaarxiv icon

Agentic Transformers Provably Learn to Search via Reinforcement Learning

Add code
May 29, 2026
Viaarxiv icon

ConMoE: Expert-Pool Consolidation via Prototype Reassignment for MoE Compression

Add code
May 28, 2026
Viaarxiv icon

ESPO: Early-Stopping Proximal Policy Optimization

Add code
May 28, 2026
Viaarxiv icon

Harness-Bench: Measuring Harness Effects across Models in Realistic Agent Workflows

Add code
May 27, 2026
Viaarxiv icon

MemAudit: Post-hoc Auditing of Poisoned Agent Memory via Causal Attribution and Structural Anomaly Detection

Add code
May 22, 2026
Viaarxiv icon

Formal Skill: Programmable Runtime Skills for Efficient and Accurate LLM Agents

Add code
May 19, 2026
Viaarxiv icon

Multi-Depth Uniform Coverage Path Planning for Unmanned Surface Vehicle Surveying

Add code
May 13, 2026
Viaarxiv icon

Thinking with Reasoning Skills: Fewer Tokens, More Accuracy

Add code
Apr 23, 2026
Viaarxiv icon

EdgeFormer: local patch-based edge detection transformer on point clouds

Add code
Apr 23, 2026
Viaarxiv icon